CDS

Accession Number TCMCG010C09452
gbkey CDS
Protein Id XP_016562646.1
Location complement(join(3520018..3520161,3520260..3520403,3520486..3520557,3520659..3520727,3520820..3520906,3521386..3521491,3532015..3532134,3532229..3532305,3532389..3532572,3532665..3532772,3535972..3536040,3536732..3537576))
Gene LOC107861790
GeneID 107861790
Organism Capsicum annuum

Protein

Length 674aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA319678
db_source XM_016707160.1
Definition PREDICTED: protein PAF1 homolog [Capsicum annuum]

EGGNOG-MAPPER Annotation

COG_category K
Description Paf1
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
KEGG_ko ko:K15174        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04011        [VIEW IN KEGG]
map04011        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCTTCGTATAGGCCATTCCCTCCACCATCTCAGTCGAGTTTTGTTCCGCCGCCACCCCCGCCGCAGAATCAGAATCCTCCGCCACCGCCGCCGTCTCAATCGAGGGGAAGTCAGTATTCGCAGAATTGGGGTTATGATGGATCTTCGTATTATCAGCATCCTGGTTACGTTCCTCCACCTCCTCCTCCGGGGAGGAGTCAGTATCAGCCACCGCCTCCGCCTGATTCTTCGTATCCACCTCCGCCACCACCTTCCGGGCAACCTCCTCCACCTCCTGCTCCAATGTATTATCCGTCTTCGCAGTATTCGCAGTATAGTCAAAACCAGCCTTTAGAGCCTCCACCTCCACCTCCTCCTTCGTCTCCTCCGAGTTCATCTATTCCTCCTCCTCCACCGCCTTCTCAACCACCGTCCCCTCCGCCACCTCCTTCGTCAGCTCCACCTCCAAGCCAACGTAATGAAAGTAGGCCTAGTGGAGAGAAAAAGCGAGAATCTGGTTGGCGTGAATCAGGGCATCGGTCTAAACAGCCAGGGCATTCAGTTCCTCCATTGCCAGTGAAGAAAGCTAATGCTCCTTCAGGGAGGGTTGAGACTGAGGAAGAGAGGAGGTTAAGGAAGAAGAGAGAGTTCGAAAAGCAAAGGCATGAAGAGAAGCATAGGCAGCAATTAAAGGAATCACAAAATAGAGTGCTGCAGAAAACCCAAATGTTGGCTTCTGGTATGAAGGGTCATGGGTCAATTAGTGCATCGCATATGGCTGACAGAAGAACTGCCCCTTTGCTAAGTGGTGAGAGGACAGAAAACCGGTTAAAGAAGCCGACAACATTTCTTTGCAAGTTAAAATTCAGAAATGAATTACCAGATCCGACAGCTCAACCAAAGCTTTTGACTTTAAGAAGAGACCCAGATCGCTTCGCGAAATACGCAATTACCTCATTGGAGAAAATGCACAAGCCTCAACTATATGTTGAACCAGACCTTGGAATTCCGCTTGACCTTCTTGATCTCAGTGTGTACAATCCTCCCAAGGGTGTAAAGATACCACTTGCTCCAGAAGATGAAGAGTTGTTGCGTGATGATGAGCCTATAACCCCCATCAAGAAAGATGGCATAAAAAAGAAGGAAAGACCAACTGACAAAGGTGTTTCTTGGCTGGTCAAAACACAATACATCTCTCCTCTTAGCACGGAGTCAGCAAAACAGTCTCTGACTGAAAAGCAAGCTAAAGAATTGCGAGAAAATAGAGGCGGCCGCAACATCTTGGAGAATCTTAACAATAGAGATAGACAAATTCAAGAGATCAAGGCATCTTTTGAGGCATGCAAGTTGCGGCCCATTCATGCGACCAATCACAGATTGCGGCCAGTCAAAGTCCAGCCACTTTTCCCCGACTTTGATCGGTATATGGACCAGTTTGTGCTTGCGAATTTTGATAGTGCTCCAACTGCTGATTCAGAAACCTACAACAAGTTGGATAAAACTGTTCGTGATGCATGCGAATCACAGGCCATTATGAAAAGCTTTGTGGCTACAGGCTCAGATGCAGATAAACCTGACAAATTTCTGGCATATATGGCCCCTGCTCCAAATGAGCTATCGAAGGATATGTATGATGAAAACGAGGATATCTCATACTCTTGGGTTCGGGAGTATCACTGGGATGTACGAGGTGATGACGTAGATGATCCTACTACATACGTTGTGGCATTTGGTGAAACAGAGGCCTGTTACATGCCTCTTCCAACAAAGCTTGTTTTGAGGAAAAAAAGAGCTAGAGAGGGGAAATCAAATGATGAAGTCGAACATTTCCCAGTTCCCTCGAGAGTTACAGTGAGGAAGAGACCAACTGTAGCTGCTATTGAACTGAAAGAAGAAGGGGGTTATACAACAGCTTTGAAGGGGAGTGTGTCAAGTTCCAAGAGAACAAGAATCGGTCATGAAGATGCTGTCGACGAACAGCTGCTTGATATGCATGATGGTGATCAAGATCAGTCGAGTGGTGGCGAGTACATGTCTGATTGA
Protein:  
MASYRPFPPPSQSSFVPPPPPPQNQNPPPPPPSQSRGSQYSQNWGYDGSSYYQHPGYVPPPPPPGRSQYQPPPPPDSSYPPPPPPSGQPPPPPAPMYYPSSQYSQYSQNQPLEPPPPPPPSSPPSSSIPPPPPPSQPPSPPPPPSSAPPPSQRNESRPSGEKKRESGWRESGHRSKQPGHSVPPLPVKKANAPSGRVETEEERRLRKKREFEKQRHEEKHRQQLKESQNRVLQKTQMLASGMKGHGSISASHMADRRTAPLLSGERTENRLKKPTTFLCKLKFRNELPDPTAQPKLLTLRRDPDRFAKYAITSLEKMHKPQLYVEPDLGIPLDLLDLSVYNPPKGVKIPLAPEDEELLRDDEPITPIKKDGIKKKERPTDKGVSWLVKTQYISPLSTESAKQSLTEKQAKELRENRGGRNILENLNNRDRQIQEIKASFEACKLRPIHATNHRLRPVKVQPLFPDFDRYMDQFVLANFDSAPTADSETYNKLDKTVRDACESQAIMKSFVATGSDADKPDKFLAYMAPAPNELSKDMYDENEDISYSWVREYHWDVRGDDVDDPTTYVVAFGETEACYMPLPTKLVLRKKRAREGKSNDEVEHFPVPSRVTVRKRPTVAAIELKEEGGYTTALKGSVSSSKRTRIGHEDAVDEQLLDMHDGDQDQSSGGEYMSD